Increase running pod memory limit for rapid_appends to prevent cgroup OOM#1282
Conversation
|
@yaozile123: The label(s) DetailsIn response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
|
Skipping CI for Draft Pull Request. |
There was a problem hiding this comment.
Code Review
This pull request modifies the configureLargeFileResources function in the GCS Fuse integration test suite to set resource requirements for the test pod when performing rapid append tests. Feedback was provided regarding a mismatch in the memory limit, suggesting it be adjusted to 3Gi for consistency with the sidecar container's configuration.
|
[APPROVALNOTIFIER] This PR is NOT APPROVED This pull-request has been approved by: amacaskill, yaozile123 The full list of commands accepted by this bot can be found here. DetailsNeeds approval from an approver in each of these files:Approvers can indicate their approval by writing |
What type of PR is this?
/kind failing-test
What this PR does / why we need it:
The
rapid_appendsGCSFuse integration tests fail with exit code 137 in file cache environments due to reaching the container's memory limit (Killed by cgroup OOM). This happens because the file cache suites default thevolume-testerpod to a1Gilimit, while compiling and executing therapid_appendspackage can peak at around 1Gi of memory usage.This PR updates
configureLargeFileResourcesto ensure thatrapid_appendstests bump thevolume-testerpod to a standard3Gimemory limit, preventing hard limit breaches.Which issue(s) this PR fixes:
Fixes #
Special notes for your reviewer:
Tested on managaed driver with ZB enabled